Human Object Interaction Detection


Human-object interaction (HOI) detection is a task of identifying a set of interactions in an image, which involves the localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and the classification of the interaction labels.

PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments

Add code
Jun 07, 2025
Viaarxiv icon

3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model

Add code
Jun 06, 2025
Viaarxiv icon

Prototype Embedding Optimization for Human-Object Interaction Detection in Livestreaming

Add code
May 28, 2025
Viaarxiv icon

Locality-Aware Zero-Shot Human-Object Interaction Detection

Add code
May 26, 2025
Viaarxiv icon

Object Concepts Emerge from Motion

Add code
May 27, 2025
Viaarxiv icon

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion

Add code
May 25, 2025
Viaarxiv icon

On the use of Graphs for Satellite Image Time Series

Add code
May 22, 2025
Viaarxiv icon

When Bias Backfires: The Modulatory Role of Counterfactual Explanations on the Adoption of Algorithmic Bias in XAI-Supported Human Decision-Making

Add code
May 20, 2025
Viaarxiv icon

Visual Affordances: Enabling Robots to Understand Object Functionality

Add code
May 08, 2025
Viaarxiv icon